Through LKED with Gun and. Camefa: 
Object/Load Modules, Link: Editors, 
Loaders, and What They Do for You, 


SHARE 83, Session 4813 


John R. Ehrman 


IBM Santa Teresa Laboratory 
555 Bailey Avenue, B16/D3 
San Jose, CA 95141 * 
1-408-463-3543 
EHRMAN@VNET.IBM.COM or USIBM9WH@IBMMAIL. COM] 


© IBM Corporation 1994 


10 Aug 94 


Table of Contents Contents-1 
SP RT A ET SR PTE ND IO ET ST SS EAN SN PETES | 
IntrOGUCTION: 66.058 bce. Vas 4 verre ee Tore ME Gia Pine BO eR sci selersatlaele Gos Baw enone ec 
Topic Overview .....:.. es lsen lh seb tatavra aoe Hex erie: tab eels a far ane cas wa tee i sie aac ae de whic ca ed 2 
What Happens to Your Program? ee ee ere ig ec ee Set pare aie eset bean Be obs Haas wee abe elas: he i 
Why is Linking Needed? ................ Pe eee se eee eee ny eee eae ee ore 6 
Putting the Pieces Back Together ........... hes Sh iE eee ates Rte Osh eee td es ite ats 7 
Some General Definitions ........... as eae Rieadiaueeenw a hos etn tueeoes Geeceu ide o 
Translator Cutput: Object Modules ....:.....020000. oo dere esas Se ae ee ee “44. = 
Some IBM-Specific Definitions ........ 0... ee eee ene Hee om 2s oto hatiat 8 oles 
Translator Output: The Object Module .................. re re Pere ee ee iste elects 13 
Object Module External Symbol Dictionary (ESD) ....... 0.0... 0. ce ccc ee eee 14 
Origins of External Symbol Dictionary Items ......... il ta BG Sa tecere ie datatens Lehre ein cath Oe 
Object Module Machine-Language Text (TXT) ...........50008% Soa Wik aes, Gee a Geass: LO 
_ Object Module Relocation Dictionary (RLD) ..... AE aes aes cs sctap cathe Ce 2s arse fea wok ites Pee ree 17 
Object Module Internal Symbol Dictionary (SYM) Pe ee re oy ee eee Se ee or 618 
Object Module End-of-Module (END) ......... 0... ce eee ee ee eee eee ig eae: ia 19°. 
Other Object Module Records (CMS) ...........00005 Cee ree Ne ere elena eden 20 
Combining Object Modules with the Batch Loader 2.2... cc cc eee ee ee tet ee eee eae 21 
Combining Object Modules: a Simple Example ................005. ft debbie Stes ve on ae 22 
Combining Object Modules: First Object Module ......... MRS Wei Beh Sh Oss, wet Lace a Gartehas 23 
Combining Object Modules: Second Object Module ................. Geet, Seat ated se raves a 24 
Combining Object Modules: Batch Loader Actions ............ 0. cee eee ee eee ees 25 


acre pesca oaeaeticrcenrc ee cieeiemppeetcne aoe sesoomeseeaenene soocineiear en ereareeeatiacmmneaniwasredieenas nar eieaac toma eS aR 
Linkage Editing, Loading, Object & Load Modules SHARE 83, Session 4812 
© IBM Corporation 1994 10 Aug 94 


Table of Contents Contents-2 


Se i STE ST STE NESE SE SS TE ES 


Combining Object Modules: Resulting Program 


Saving Linked Programs: Load Modules 
What and Why are Load Modules? 
What Is In a Load Module? 

Schematic of a “Normal” Load Module 
Linkage Editor Inputs and Outputs 
Linkage Editor Processing 
Pseudo-Registers 

Peculiarities of Load Modules 


Overlay Modules 
What and Why are Overlays? 
Example of an Overlay Structure 
Arranging an Overlay Structure 
An Overlay Structure In More Detail 


Bringing Load Modules into Storage: Program Fetch 
Program Fetch -- A Relocating Loader 


Looking Backward 
Some History 
Assumptions and Constraints on 1963 Designs 
Limitations and Extensions 


Linkage Editing, Loading, Object & Load Modules SHARE 83, Session 4812 | 
© IBM Corporation 1994 10 Aug 94 


Table of Contents Contents-3 


_ a a a aa A aS ST ase S aS 


What Are The Problems? | 


‘Looking Forward .......cceccceecceecceecccececeeuaes bavi Resyedeee eae ee eee ee. 
The DFSMS/MVS Binder ahd Loader | ae ree Se ee ee CO ner ere ane Cre 53 
Simian. Ls elutes thd eer at tay hae ene caret S ese eaaese ateeneete eee bite here aches 54 
What We’ve Discussed ........ ares Beh oe NEE te cee gees wl cussed od teat ay eer an Pee a ee 55 
ROICTCNGCES® (454 & ei nee Se ee Ee de eas SE eh oh os Bw ak Bee ee se ee 56 
Linkage Editing, Loading, Object & Load Modules SHARE 83, Session 4812 


© IBM Corporation 1994 10 Aug 94 


. 


Introduction 
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Topic Overview | 2 
RS SESSA OS Pa RS SEE SE SSS CAE TSE CE EGA RE TT ET TS SS ST SET TST ICES ESR RTSSOT | 


, . What happens to programs “on the way to execution” 

e Why program linking is needed. a | 

e What assemblers and compilers chica: object modules 
© What program linking does with object modules 

¢ Saving the results of linking: load modules 

¢ What happens when load modules: are put into storage 

e Why the Linkage Editor and Loader are the way they are 


¢ The future: the good things the new Binder does for you 
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What Happens to Your Program? 


1. The Beginner’s View 


The COMPUTER 


My Prograin F Something Magical Happens My Output 


2. The After-a-Little-Experience View 


The COMPUTER 


A Source Language Peculiar and 
Program Compiler ft Mysterious My Output 
Happens Here 


eae [ 
| Something Very 
> 


e We learn to distinguish between compile time and run time 
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What Happens to Your Program? ... 


3. The After-Some-More-Experience View. 


Source | _ | Language | Program 
Program Compiler Linker Execu- 
| tion 


Compile 


| Time 


_« We learn to distinguish among compile, link, and run times 
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What Happens to Your Program? ... 


4. Our View 


Linkage Load +>! Program | 
Source Editor Module Fetch Program 
Trans- ; j se 


>; Execution 
lator and Output 


Batch 
Loader 


Compile 
Time Link Time Run Time 


Our Concerns: The Program Linking Process 


e Our focus will be almost entirely on the five items in the “Link Time” box 


e We will refer to some compile-time and run-time topics and issues 


oo e=e=e=e=S———e_ee=S ———————————————————————_———_—————————————————————— lH 
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Why is Linking Needed? 6 


—eeeeeEeEeaeaEaeeEeEeEeEeeeeeeeeee=e=—===s=S—=S==———ee=S=S————eee————e=————e=—S————————>—€——=_=_=_=_—====_ 


¢ Anything that gets “big” is hard to manage | 


e@ The world’s oldest paradigm for handling big problems: 


— “Divide and Conquer”: break the problem into manageable pieces 


— Many dignified names have been given to this: Analysis, Modular 
_ Decomposition, Top-Down Analysis, Program Partitioning, Structured 
_ Programming... _ 


-— As your mother told you, | 
“Don’t try to eat that whole thing! Cut it into pieces first!” 


e Naturally leads to the question: | | 
— How do] put the divided and conquered pieces back together again? 
— “Synthesis” is the dignified name 


— As your mother told you, 
“If you took it apart, it’s up to you to put it back together!” 


¢ Program linking and loading are fundamental to any system 
— Linker capabilities (or shortcomings) have profound and widespread impacts 
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Putting the Pieces Back Together |. v4 
| A 2 ER RS TT SE PE EE TOES 
Putting the pieces back together (“binding”) can occur at many times 
— Compile time -- compile all needed items from source 
— Link Edit (pre-execution) time -- everything “bound” prior to execution 
— Program initiation time -- everything “bound” immediately prior to execution 


— Execution time -- pieces “bound” only if required 


Choice of “binding time” implies trade-offs: 


— Earlier times: efficiency vs. inflexibility 
— Later times: efficiency, flexibility, modifiability vs. costs 


— “Efficiency” is measured in many dimensions...! 


Program re-composition requires additional information: 
— A way to name the pieces to be bound 


— A way for the pieces to refer to one another 


er peerreneeeieseaeedesnceeasiricasniaeneesecipepeosiepnesnesancrsratessaeespnsemuaemsertoeegnnovemeovalesaasssseoneinicmmanoseneesssemsnsaresmas enema emnetneneierlactmennniaiaaninoeanne aenaameiaearaneeseannioas asetarpepoemneenaninaemestieeeeeapeseenammnamnee™ ered 
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Putting the Pieces Back Together ... 8 
eerste SSS Reenaslnsnaaaessi=n=ve=nssnssspeeesantl===as==ssensnpeasn=nntesss=earatannsnsnsnsnasserstansusssessnatsbusanansnantareaseuameasseree=smansmsnsn'rsssnnasssteeasmeeesssamar=neansnnsmenea=anrennersnnemeapaenaest 
¢ In this discussion: | 
— Information to assist with “re-composition” (cr “binding”) 
— External names: used to name the pieces to be bound 


— External names, address constants: let the pieces refer to one another 


¢ Our concerns, and the program re-composition tools involved: 
— Link-edit (pre-execution) time: Linkage Editor 
— Program initiation time: Batch Loader 


— Execution time: Operating System Program Fetch services 


e Understanding the pieces, and how they were bound 
— Link Editor and Batch Loader MAPs? AMBLIST? 


— DFSMS/MVS Binder is much more informative (more about this, later) 


——ooeEEEEEEEENEeeeQqqqqeeeeeeee 
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Some General Definitions 


Note: many of these terms are used quite flexibly in this industry... 


¢ Load, loading 
— Place a module into central storage 
¢ Link, linking 
— Resolve symbolic (external) names into offsets or addresses 
— Combine multiple (input) name spaces into a single (output) name space 
— Sometimes called “binding” (but that term is much more general) 
e Absolute loader 
— Places a module into storage at a fixed address, without relocating anything 
— Example: CMS’s “traditional” non-relocatable MODULEs 
e Relocate, relocation 


— Assign actual-storage or module-origin-relative addresses to address constants 


Tea ota See SSe aca cE SSeS aS aes | 
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Some General Definitions ... 10 
[SS Le RPA SALA TT ATS SL SE SEE ESE aE SS EE EE TE SE PSO RAT SPEED | 
e Relocating loader 


— Places modules into storage and updaies (relocates) addresses 2 their actual 
“ge, 33 
final” value | 


Example: Program Fetch, CMS Loader 
e Linker, Linkage Editor | 
oo Creates linked relocatable modules for ‘ater loading — 
- =F Example: Linkage Editor 
e Linking loader 
_— Places modules into storage with linking immediately p prior to program execution 
— Example: MVS Batch Loader 
¢ Dynamic loading | 
— Place modules into storage (with relocation) during program execution 


— Examples: parts of modules loaded by overlay, or modules loaded via LOAD, 
LINK, XCTL, ATTACH 


¢ Dynamic linking. 


— Place modules into storage with linking during program execution 
— Example: TSS 
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Translator Output: Object 


Modules 


e For the exciting details, see Appendix C of $C26-4941, 


High Level Assembler/MVS & VM & VSE Programmer's Guide 


_ a STS | 
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Some IBM-Specific Definitions 12 


¢ Control Section (CSECT, for short) © 


— <Acollection of program elements, all bearing fixed positional relationships to one 
another 


— A unit whose addressing and/or placement relative to all other Contro| Sections 
does not affect the program’s run-time logic 


~ The basic unit of program linking 
. — Types: Ordinary (CSECT), Read-Only (RSECT), Common (COM) 
¢ External Symbol (a “Public” symbol; internal symbols are “Private”) 
— A name known at-program linking time 
— A symbol whose value is intentionally not resolved at translation time 
¢ Address Constant (“Adcon”) 


— A field within a Control Section into which an actual address will be placed 
during program relocation 


¢ Pseudo-Register (or, External Dummy Section) 
— A special type of external symbol with a separate “name space” 
— More about these, later 
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Translator Output: The Object Module 


e 80-character (card-image) records, with 3-character ID in columns 2-4 
ESD External Symbol Dictionary 
TXT Machine Language instructions and data (“Text”) 
RLD Relocation Dictionary 
SYM Internal Symbols 
END End of Object Module, with IDR (Identification Record) data 
¢ One object module per Compilation Unit 


e “Batch” translations may produce multiple object modules 


—————————— NRC nNOS NS NNN 
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Object Module External Symbol Dictionary (ESD) 14 


rE 
¢ Describes external symbols (1 to 3 16-byte items per record) — 


e Numbered sequentially within each obiect module, starting at 1 
— The (16-bit) number is called the ESDID : 
o Four basic classes of external symbol: 
SD,CM Section Definition: the name of a control section. 
= Data: ESDID, length, section-origin address, AMODE. & RMODE. 
= Blank SD name sometimes called Private Code (PC) | 
— Common (CM) handled differently from SD items | 7 | 
Label Definition: the name of a pcsition at a fixed offset within 
a Control Section; typically, an Entry Point 
— Data: Address of the label, and ESDID of the section it’s in 
External Reference: the name of a symbol defined © 
“elsewhere” to which this module wants to refer 
— Data: ESDID 
PR =—~—C Pseudo Register: name ‘sf a Pseudo Register 


(the Assembler calls it XD, External Dummy Section) 
— Data: ESDID, PR length and alignment requirement 


e ESD records must appear first in each object module 


_ a SS TS SSS TE ST EL SSS PE SE Or PST SE ES ALLTEL | 
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Origins of External Symbol Dictionary Items © 


¢ ESD items originate in various language constructs, such as: 
ESD item Assembler | VS Fortran | OS PL/I VS COB. Il | C/370 ; 


Csect, Routine, Procedure | Outermost | R/W data 
Rsect Block Data program 
Com Common External 

static 
Extrn, Call, Call, data | Static Call | Call, data 
V-con Common reference | Literal reference 


File, Writable 
Fetchable, static 
Controlled 
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Object Module Machine-Language Text (TXT) 
cece reeset pes NSNEDESnnpan-o=rnsSuESNEspstnantsopnronsssNunuvaansssvaissn=snaisssnsanspsesssnsssnsessstansasnssinnsnsssssspsnsnpssannrermaspesntseniuanananannansnenaneanenasansassiseanrnrcnisanssoeieesuaseeasespl 


¢ Contains machine language instructions and data 
— Up to 56 bytes per record 
—¢ Data: 
1. How many bytes of text data are in this record 


2. ESDID of the control section it belongs in 


3. Address within that control section where the text is to be placed 


¢ Always a contiguous string of bytes 


— Discontinuities in the “text” stream start a new TXT record 


reece SHS pSNESPSPeesnpaaanio=ne=eseasarsbusnenoessnanannsescranrennianeannssosnsgrosmsansamnnsteanenasataasunrunneresonamasn=ntesaoannunanteennanasansasnananesonnoeeaseamanseeanmennn=anene=nnamananaateeend 
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10 


Object Module Relocation Dictionary (RLD) 


e Packed stream of 2-byte or 4-byte RLD items 


¢ Information about relocatable (and Q, CXD) address constants 
— Where the constant is to be found 
— What value should be in the constant (what it should point to) 


¢ Each RLD item has 6 pieces of information: 


1. R Pointer: ESDID of the name whose “target address” it should contain 
— l.e., what it points to 

2. P Pointer: ESDID of the section where the constant resides 
— l.e., where to find it 


3. Address: the address at which the constant resides within its section (as 
specified by the P pointer) 


4. Length: the constant’s length (in bytes) 


5. Type: whether it’s an A-type (data), V-type (branch), Q-type (PR offset), or CXD 
(PR “Cumulative Length”) 


Warning!! A- and V-type constants can be very different!! (More later...) 


6. Direction: whether the target address should be added or subtracted for A-type 
constants 


arc re enna eens eaeeseneree=fanoasanenenaalnerpyanesareasntannse=stsnessner=atasaensseinnse==ns=snepetnssaeassnes=rasanaspSseaenennenseneeaseeneysneniysaiasseaneannennsa==ssenassssnsnaatsaneanenenesnsseanseeet 
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Object Module Internal Symbol Dictionary (SYM) 18 


| ora SETI A A SPE ETT CT FST SoS SSS BS NE ECTS A AE SE SS SS SR | 
¢ Contains internal symbols used by source translator 
— Produced by Assembler, VS Fortran 
¢ SYM information is (sometimes) useful for debugging 
¢ Ghastly bit-squeezing packed format (details are truly impressive) 
_ — Maximum symbol length is 8 characters 


e Linkage Editor doesn’t make SYM records convenient to use 
— Copies SYM (and SD,CM info from ESD) records to front of load module 


— No system facilities for retrieving them easily! 


¢ Recommend using High Level Assembler SYSADATA output instead 


: . — More information, in a more usable format 

e 
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11 


Object Module End-of-Module (END) & 


eH ERSTE SS PE SESS AS SE SES RS SSN SEES | 


e Primary function is to signal the end of the object module 


¢ Some additional (optional) information may be provided: 
— Requested execution-time entry point 
— By ESDID and address, or by external name 


— These requests may be overridden by other factors or controls 


Actual length of a Control Section whose length was not specified on its ESD 
record 


— This feature saves effort in some compilers 
Identification (IDR) data (0, 1, or 2 19-byte IDR items) 
— Translator’s product number, with version and modification level 


— Date (YYDDD format) of the translation 


Sa IP SS SET LE SE ST SE EZ SS ESE SATE RF SE Se SE AEE 
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Other Object Module Records (CMS) 


rere reece ene ennnnnencane a AA A Nee pentnsnnenensnontnanner=aarsuun-sansuensstestelnmneans seennansenennareeeseeenenenennenmsanenenesneenrennatieanensaneend 


-¢ CMS LOAD has meager control-statement capabilities 
— Only ENTRY and LIBRARY statements 
° Object- -like records can be used for some control functions 


' REP Replacement text: behaves like a TXT record, but hex values 
| are specified in EBCDIC for ease of preparation 
—- Also used by the VSE Linkage Editor 


LDT Loader Terminate: last record of a group of object modules 
with optional indication of an entry address and SETSSI info 


ICS —_— Include Control Section: placed ahead of an object module to 
override the original length of a named control section 


SLC Set Location Counter: sets the (absolute virtual) load address at 
which the following modules will start loading 


SPB Set Page Boundary: sets the loader’s location counter to the 
next page boundary; may appear before/after any module 


e See the CMS LOAD command description for further details 


ENS SSE SSPE TE SSS FS EP ES SEY EE ERTS ES SU AE RA SAINTS 
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Combining Object Modules with 


the Batch Loader 


¢ A simple example of initiation-time linking 


e Illustrates the basic principles involved in linking 
— Applicable to CMS, also 


¢ It can do a lot more than this example shows 


Linkage Editing, Loading, Object & Load Modules SHARE 83, Session 4812 
© IBM Corporation 1994 10 Aug 94 


Combining Object Modules: a Simple Example 22 


¢ Suppose a program consists of two source modules: | 
Module lL 7 Module 2. | 


7 Loc Loc — — , 
. 000) MAIN 000 | SUB | 
- COMMON /WORK/ ... | COMMON /woRK/ ... 


EXTERNAL ZDATA 


CALL SUB i 7 
ae 700 


} == Addr (WORK} 
260 Addr (SUB) | 704 Addr (ZDATA) 
204 Addr (WORK) — ~-- | 
208) Addr (ZDATA) 


ENTRY ZDATA 


260) ZDATA 


(For this example, values 
are given in decimal) 


— Program MAIN contains a ZDATA entry point, and refers to the COMMON area 
named WORK , 


— Subprogram SUB refers to the external name ZDATA, and to the COMMON area 
named WORK 


¢ Translation produces two object modules 
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Combining Object Modules: First Object Module 


e The object module for Module 1 would look roughly like this: 


¢ ESD defines two control sections (MAIN and WORK), one entry 


ID=1 MAIN Addr=000 Len=300 
4 ID=2 WORK Addr=C00 Len=600 


10=1 ZDATA Addr=260 

1D=3 SUB 

I0=1 Addr=000 ‘abcedefghijk' 
ID=l ... etc. 

ID=1 Addr=100 ‘mnoparstuvw 
ID=1 Addr=260 '01234567890' 
IDzl ... etc. 

PID=1 RID=3 Addr=200 Len=4 Dir=+ 
PID=1 RID=2 Addr=204 Len=4 Dir=+ 
PID=1 RID=1 Addr=208 Len=4 Dir=+ 
Entry=MAIN 


SO for CSECT MAIN, ESDID=1, Len=300 
CM for COMMON WORK, ESDID=2, Len=600 
LD for Entry ZDATA, ESDID=1, Addr=260 
ER for reference to SUB, ESDID=3 
Text for MAIN, address 000 

Text for MAIN 

Text for MAIN, address 100 

Text for MAIN, address 260 

Text for MAIN 

RLD item for Addr(SuB) 

RLD item for Addr (WORK) 

RLD item for Addr(ZDATA) 

End of module, entryname=MAIN 


(ZDATA), one external reference (SUB) 


¢ RLD contains information about three address constants 


Sea EES YP SS AE ECE SSE SNE SS TS SPENT SES SE SESSA SRE LE A TE SE Do STAI 
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Combining Object Modules: Second Object Module 


_ a aL | 


° The object module for Module 2 would look roughly like this: 


ESD SD I1D=1 SUB Addr=000 Len=800 
ESD CH ID=2 WORK Addr=000 Len=400 
ESO ER 1D=3 ZDATA 


1D=1 Addr=040 
10=1 eee etc. 
ID=1 Addr=180 'gkizxcvbrm' 


1021 eee etc. 
~ PID=1 RID=2 Addr=700 Len=4 Dir=+ 


PID=1 RID=3 Addr=704 Len=4 Dir=+ 


‘qweruiopasd' 


SD for CSECT SUB, ESDID=1, Len=800 
CM for COMMON WORK, ESDID=2, Len=400 


-ER for reference to ZDATA, ESDID=3 


Text for SUB, address 940 
Text for SUB 

Text for SUB, address 180 
Text for SUB | 
RLD item for Addr (WORK) 

RLD item for Addr(ZDATA) 
End of module 
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24 


© ESD defines two control sections (SUB and WORK), one external 
reference (ZDATA) 


e RLD contains information about two address constants 
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Combining Object Modules: Batch Loader Actions 


e The Batch Loader 


1. Builds a single (“Composite”) ESD 


~ Merges ESD information from the object 


modules 


— Renumbers ESDIDs, assigns adjusted 


address values to all symbols 


. Places text in storage at designated 


addresses 


. Determines length of COMMON (retains 


longest length) 
— Allocates storage for it 


. Relocates address constants 
5. Sets entry point address 
6. Enters loaded program 


¢ Composite ESD 


ESDID| Addr 


Type) 
1 er 300 
01 123760, 
|123800, 800 
124600 600 


125200 
peel 
123500 


(For this example, values are 
given in decimal) 


¢ Suppose initial program load address 


is 123500 
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Combining Object Modules: Resulting Program 


* The resulting program, loaded into storage for execution: 


123500 (MAIN) 123800 (SUB) 
abcdefghijk... 
...more text... 
«+more text... 
mnopqrstuvw... 


oe text... 


“| ger bOX beac: 
oo COXEwee | 


| 3 aes «ew tOxt cack 
A(SUB) | 00123800 


00124600 
00123760 
(ZDATA) 
01234567890... 


qweruiopasd... 


jklzxcvbnm... 


125200 


124600 (WORK 
— (end) 


length=600 


A(WORK) 
A(ZDATA) 


¢ Storage is allocated for three control sections (two SD, one CM) 
e Address constants are resolved to designated addresses - 
¢ Entry point is at address of MAIN (123500) 
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Saving Linked Programs: Load 
Modules 


_ aaa | 
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What and Why are Load Modules? 


_ SL LT | 


e@ Basic executable unit for MV5S-like systems | 


— The world’s longest- “surviving form of ‘ “executable binary” 


. Designed for 


1. Loading into storage with minimal overhead | 
— Binary (zero-origin) PLOgham, image, ene oniy -elocation 
2. baiting 


Retains enough information to sermait 


— Replacement of any component 
— Restructuring of the entire module 
— Renaming of (almost!) any element 
— Unless you tell the Linkage Editor not to keep it! (NE option) 
3. Minimal run-time storage requirements 
— Only “necessary” items are in storage 
— Complex overlay structures are supported . 2 | | ° 


eee en creer Nonna ntanesennsssnenenn=nanananaena=nnantaanantsaassnnnnesnnshetnanatnasensnentnnerenet 
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What Is In a Load Module? 


[a Saar 


¢ Load module structure very similar to object module’s 
— Simplifies processing of each 


Basic contents (analogous to object module records) 

CESD Composite External Symbol Dictionary 

Text Machine language instructions and data 

RLD Relocation Dictionary 

SYM Object-module records copied directly into load modules 


IDR Identification records (from object modules, Linkage Editor, 
user, and ZAP) 


EOM End of module 

Additional items having no object-module analogs 

CTL Control records, for reading and relocating text records 
SEGTAB Segment table, for overlay structures 

ENTAB Entry table, for overlay structures 

EOS End of Segment, for overlay structures 


Cee anenelaeenipeentoneeenseionnnensaaeeaeosenennssoianpesre mspnvansoueensunesntonasanetiouniopesrqnimsnreascsatsntnesonestaneaiinsaaanniasecnontn ociainaiarentnessamennensinteisinpacansieeneisessnmneeaicareonrereneeasameaneiseainaalD 
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Schematic of a “Normal” Load Module 30 


- © Basic format is called “block format” or “block loaded” 


Only if TEST option, and SYM info is present in inputs 
(May also contain some ESD data) | 


CESD 


IDR | 
CTL Info about first text record 


rp 
jtext First text record, placed at module load address 
CTL/RLD Relocate first text record, and read second text record — 
at specified offset from module load address 


Last text record 


Omitted if NE option 


EOM/RLD Relocate last text record, signal end of module 


—* Location of first text record kept in PDS directory (TT R) 
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Linkage Editor Inputs and Outputs 


ESS ST ESP SE ES SE SESS oa EI STE SESS EGA SSS SSS SSE SY EC ST SS AE ES I SSE | 


¢ Inputs 
— Object modules 
— Load modules 


— Control statements to direct the Linkage Editor 


— Where to get additional inputs: 
INCLUDE, LIBRARY 


— What to do with all the pieces: 
REPLACE, CHANGE, INSERT, ORDER, PAGE, OVERLAY, EXPAND 


— How to describe and name the output module: 
ENTRY, NAME, SETSSI, IDENTIFY, SETCODE, MODE, ALIAS 


e Outputs 


— Load module(s) 


Listing, terminal messages 


aera mre beeen sacar omemrsennratetgnres aeons eneiops ann purSesai reteset ataasesiipeintanssto aioe tomsnantemmstaas titniousbeeienpensnsccaniiaunssaoscnabanemimeasenaascuamnmmanenieaetsaeaaamennaed 
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Linkage Editor Processing 32 


eS eNNeSeaaasenneteennanseeesaeeteSeannastesaneaaencneenesnnesetsasangeeneeanemsasssssnsosenseeersmaeed 


¢ Two-pass process (very much like an assembler!) 
e Pass 1 | 


Read all inputs (explicitly or implicitly designated) 

— If not NCAL, unresolved ERs cause library search (WXs never do) | 
Build symbol table (CESD) by merging ESD/CESD items from all inputs | 
= ‘Determine lengths, orderings, offsets, etc. 

— First SD wins, longest CM wins, all nonzero-length PC items kept, etc. 


e Intermediate processing 
— Resolve interdependences 
— Assign relative addresses 
_— Write module MAP (and XREF, if entire module is in sera) 
¢ Pass 2 | 


Write out all the pieces in the correct order, with relocation data 
— STOW directory entry (or entries, if ALIASes) | | 
— Write XREF (if module didn’t fit in storage) 


Cre erro one eens meeting orentenee noes taineesieenesensesaree ates semynnsen=naeasohasiseasionep==esntn=srapaanonasuesmsmasanpsmnemennansaneresamensntapamaaeennnaspesararpentemenaasantpsetannnennanasssanrsassaneeamanaaenstaaswaneememanesamesmnmeal 
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@ Pseudo-Registers 


Allow sharing by name in separately translated re-entrant programs 
PRs have their own name space 

— Separate from all other external symbols 

— PR names may be identical to other types of ESD name without “collision” 
PR items refer to offsets in a “link-time Dummy Control Section” 

— Hence the Assembler’s name, “External Dummy” (XD) 


— The dummy section is also called the “Pseudo-Register Vector” (PRV; up to 1024 
more “registers”) 


Resolved somewhat like commons: 


— But: no storage allocated at link time, as for commons 

— If multiple definitions, longest length and strictest alignment win 

— Accumulated length/alignment of PRV items then determine offsets 

— Offset value placed in Q-type address constants referencing PR name 
— Total size of the “link-time DSECT” is placed in a “CXD” adcon item 


Runtime code must allocate a storage area of the right ( (CXD) size 


Runtime references access fields at desired offsets in that area, using 
the Q-con contents for “displacements” 
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Peculiarities of Load Modules 34 


—eESEaEoEE=e=E=EeEeEaeaE==E=e=S=s=S=ssa=SsaouEx——eeeee—e==S———=_— a eE——eeeEe=—E——e—=S———————GCV————— 


SYM and IDR put at front of module, to simplify Link Editor logic | 
CESD is at front of module, to simplify re-processing of load modules | 
PDS directory info allows Program Fetch to skip this stuff. 


= First text record’s length and disk location; storage needed; attributes; etc. 


Small record sizes 


ma SYM < 244: CESD < 248; IDR, CTL. RLD < 256; Text < track length 

If first “real” text is not at relative zero, write a 1-byte record at zero! 
“Directory name space” (PDS directory names) unrelated to external 
(CESD) names (which may be unrelated to internal names, too!) 


— Can assign member and alias names unrelated to CESD names 
— Object module item named AA, renamed to BB in load module, PDS member is CC 
— Alice would be at home in this Wonderland! 


— TSS Linkage Editor didn’t allow this confusion 
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Overlay Modules 
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What and Why are Overlays? 


ec meena ee 


¢ Overlays are more complex than block-format modules 


— Different parts of a module may share we same storage 
— At different times, of course! — 


= Require special nneds Editor considerations | 


Pros: 
— Faster initiation: only part of the program need be loaded to start 
— Economical storage use: only load what’s needed, when it’s needed 


— Can always re-link.to block format if there’s enough storage 
— But: Behavior may be different, due to loss of re-initializations! 


Cons: 
— Programs cannot be shared (no re-enterability) 
~— More complex to specify, greater care needed in coding 
— Local data may or may not be “persistent” across calls 
— Distinction between V-type and A-type adcons is important 
— External data sharing may be more complicated 
Additional overhead in calls to segments needing to be loaded 
Calls among certain modules may be forbidden (or wrong) 


Cee rereeecesnaineaieseteintntenrdaeniesoebasnensaiehansieareasesyontanasarenshoaeareementoaesnaenasnerhasaiasaasensssstan/sasianso-sasusamasaenatanreasnnan an rarramaieeneeavemenensneeaaheemenaraseemenersomenaasanaeonaaumsiamen igre anemia ne nel 
Linkage Editing, Loading, Object & Load Modules SHARE 83, Session 4812 
€ 1BM Corporation 1994 10 Aug 94 


20 


Example of an Overlay Structure 


37 


SNR NN SSE NSN eense=n= INU =nennena====aSsan—sn=ns=nsennenannal 


e Suppose MAIN calls SUBA 
and SUBB 


— Neither calls the other 


In “block format,” they 
would appear in storage as 


MAIN 
CALL SUBA 
CALL SUBB 
SUBA 
Do SUBA stuff 
Return 


SUBB 
Do SUBB stuff 
Return 
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Arranging an Overlay Structure 


e Figure out what modules 
can share storage 


¢ Draw an “overlay tree” of 
the structure 


ca Structured as a tree, with 
root at top (low address) 


as Control statements describe 
desired structure 


— In this example, three 
overlay nodes: A, B, C 


¢ Root segment is always 
present _ | 


— Contains entry point, 

~ autocalled sections, 
Segment Table (SEGTAB 
tells what segments are in 
storage) 


Do SUBA stuff 
Return Return 


e SUBA and SUBB might be 
overlaid, like this: 


MAIN 
CALL SUBA 
CALL SUBB 


SUBB 
Do SUBB stuff 


SUBA and SUBB share the 
same storage 


e¢ The overlay supervisor must 
(help) make this work! 
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10 Aug 94 


| SEGTAB . | . _ 
Entry—>| MAIN Seg- 
Point ment 
AUTO1 | 7 . 
AUTO2 | . 
eS Overlay A ene 


SUBA SUBB 
SUBJ | | 


+— Overlay C.—» 7 


SUBV SUBW 
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An Overlay Structure In More Detail 39g © 
| SEE SST BSD ES A PS ST EC EE IE TES SEAESES | 


r 


| Root | seer | - @ Each segment with subsidiary 
cy segments is suffixed with an Entry 
lawle Se ie Table to assist loading of the 
“lower” segments 
r——— a(supa) | 


| | a(suga) /}-— — SVC instructions call Overlay 
’ [sve | 4 Supervisor 
——_+—__1—_—_—_ 


maid 
| suga | | suse | 
r—— v (suas) | | | 


as | e V-type adcons may resolve to an 
= WISER) — 


| -ivesuuy ca teas ry | ENTAB, not to the named symbol! 


ee negra L—{v (suey) | 
‘| sua |¢—— | |-— — V-cons for SUBs in lower segments 
---—-— | L_slEntag3 | Entry 


I! 
J oF 
 elewasa | entry | —fatsuon)] Table resolve to ENTAB ( ff) 


~--{a(su8t){ Table | | | A(SuBW) Fy 


|a(suaR) ky 4 I|sve |e A V-con for call in same or higher 
rfa(sues)| | aera Gar are segment resolves directly (Bb) 


| |sve | | | suey | suaw | 
| 


oe 
7 

| | | 

| ol ote | | 

| | [atsone) \___1 e A-cons always resolve directly 
| | 

| | 

| 

| 


Entry Table (ENTAB) 


| suas | | suBr | | 


visuar) fo | | 4 — Addresses in ENTAB resolve directly 


{| |v¢suag) to SUBs ( §]) 


t+——|_| 
lp Pm) _ j j 
| suet |? Ey Sections in same segment ( £9) 
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Bringing Load Modules into | 


Storage: Program Fetch . 
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r Program Fetch -- A Relocating Loader 


Used for all module loading from disk (LOAD, LINK, XCTL, ...) 
— Except during IPL... | 


Skip over everything preceding the first contro! record 


— SYM, IDR, CESD (PDS directory info makes the skipping simple) 
— Therefore, no linking! (CESD info has been ignored) 


Control records tell length and relative address of following text record 
— May also have RLD information for preceding text block 
A,V-cons relocated using only address information in RLD 


— Rand P pointers ignored; RLD information discarded after relocation 
— Q-cons and CXD were completed at linkage-edit time 


Note: two levels of relocation are involved: 

1. Linkage Editor adjusts addresses relative to module zero origin 

2. Program Fetch adjusts addresses relative to module’s “load address” 
Overlay Supervisor 

— SEGTAB and ENTABs manage segment traffic; Program Fetch loads segments 


_ EE a La. | 
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~ Looking Backward 
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Some History | ©} 
SSS PETER IDES SEE SEC I SS AS PTT ST ISS A EE TS A LS CE ST TSE | ~ 


ef. 


e Linkage Editor 
— Written in 1963-65 by small team in IBM Poughkeepsie 


— Program Fetch, Overlay Supervisor done at the same time 
— PDS’s, BLDL, STOW, etc. added to OS in response to LKED needs 


— Initial release ran in 18KB (32KB machines were big!) 


e OS Batch Loader 
— Written much later (about 1972) 
— Appeared with OS/360 Release 17 


¢ Very advanced technology for that time 


— Long ago, in a far away galaxy, ... 


EE SS PES SE A RP SSE eS OS eT TS SS a TE SEE TLS 
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Assumptions and Constraints on 1963 Designs 44 
rere escent ene een eennn ee SeapeNSenaSnSlEERaNSee==Snn=s=ensEes=nntnanp=ntsNEESSnst=pane=noesiasasassandstsenssnssnanpannes=sUssunssnsnssaacensmnasnseanssp=eeeannnnsraiseasanrersusnsanesrarnsssnansetenastceareaaeassant-cneamnsead 
© Early-binding philosophy: systems are expensive, people are cheap | 
— Programs run for long periods between needed changes 
— Therefore: recompile “deltas” and re-link them into the application module 
© Re-linking is cheaper than re-building from scratch | 
— Therefore: keep enough info within the module to make “editing” possible 
¢ DASD is slow, and central storage is precious and expensive 
— Therefore: short records are a good thing - 
— Therefore: packing module pieces tightly is a good thing 
— Therefore: overlay structures are a very good thing 
¢ 24-bit addresses and lengths are adequate for a very long time 
— Therefore: Everything must be smaller than 16MB 
— Therefore: AMODE and RMODE were “patched in” 
— Therefore: no “scatter loading” by RMODE; entry points don’t have own AMODE 


e 8-character upper-case EBCDIC names are adequate for a ven long — | . 
time | 


¢ Central storage is real (not virtual) 
a 
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Limitations and Extensions 


e Many current limitations that products must cope with: 


: — Short names, 16MB size, mono-modal modules, rigid formats, inadequate ESD 
types, no room for descriptive data, internal table limits, strange loopholes, ... 


¢ Some products invent “private” object formats, overload ESD names 
— Feed their output through a “pre-linker” ahead of the Linkage Editor 


Private Pre- Link Norma] 
| Compiler Linker Editor Load 


Module Module 


Additional Linking Step 


— Updates may force complete re-link from private objects 
— May have to “play games” to fool some existing tools (e.g. CMS TXTLIB) 
¢ We must consider new formats for translator outputs 
— Many languages need more function: C, Ada, Fortran-90, anything O-O 
SSE ST SE Se EE STE SS LS SSS SESS ECS SR NE TO 2B PS ON TTT 
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What Are The Problems? 46 


ee ——==========—eeEeeeE=——ee=S——=—= ————————e=—=————e————————————————— ll 


¢ One hardly knows where to Start! 
ie Some problems are generic, some are particular to each record type 
¢ General problems: 
— Fixed Bea of records and fields 


— 16MB size/length limits due to 24-bit length and address fields 


— Inefficient use of file space 
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What Are The Problems? ... 47 
ESR SR A ASE AE PSPS SOIC SP SE SE NSP A AEE CE I EE DLE ST MI ESI SESE: 
e ESD records: 
Long names are impossible to accommodate (without loophole games) 
16 MB size/length limit on everything 
Inadequate range of ESD types 


Mono-modal modules and entry points 
— Entry points in a CSECT can’t have different AMODEs 


No properties information 


— Is it really RENT? Movable? REFR? REUS? Read-Only? Is it R/O data (constants)? 
Pure code? Code and R/W data? 


No way to specify section alignment 
CM/PR “ownership” very muddled 
No data can be specified for CM items 


No attributes of modules or entries 
— Code? Data? (Should A or V point to this?) 


No way to provide descriptive data 
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What Are The Problems? ... 
eGeGeoouCllyjuuu”uq eee yy ———————EE 
TXT records: 
— Maximum of 70% utilization 
— No way to specify text attributes 
"= Is it code/data? Is it RO/RW/XO? 


— Do pieces have different RMODEs? 


— Can't specify initializations for holes/gaps 


— Can’t request data encoding or compression 


ener cnirereseesenteanliusnciraeeg snemasseosientntsoyeementorneevuonaianemsioneavaneyentowtauecinostesiamsiataaasusreesemanuinmassenaaeametamater oeioamsaaeiaaianeeneeaiaeeemnanoe eeaameaseensiannemeescanmeneaaeeeetenamanseaaniammamnenmene emma een 
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What Are The Problems? ... 


RLD records: 


— Available “type info” is often abused (or not respected) by coders 


— A-type and V-type adcons (mis-)used as essentially equivalent 
No checking is done between pointer/pointee 
Cannot specify addressing modes for pointers 


Cannot assign attributes for references 


— Eg. this is a pointer to data; to code; etc. 


No “extended attributes” to allow interface-conformance checking 


aaa LD 
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What Are The Problems? ... 


| aA aS TT aT A 


SYM records: 


Painfully comolex, hard-to-use data formats 


Symbolic names are truncated to 8 characters 

No XREF and reference information is provided. | 
No tie-backs from code and symbols to source statements. 
— No source statements are retained, either! 


— Writing listing-scanners is not a very good approach... 


END records: | 
— No way to specify entry point’s AMODE 
— Cannot specify more than one deferred length - 


_— No provision for richer (and more useful) IDR data 
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What Are The Problems? ... ©} 


e And then there are Load Modules: 


— Inherit all the shortcomings of object modules 


— Short names, single modes, 16 MB limits, etc. 


And add some new ones, too... 
— Peculiar module structures 
— Inefficient record sizes 
— When re-linking, some items are “sticky” 
¢« PCs with code, CM lengths, PR length/alignment, SYM, IDR, ... 
— System can’t LOAD SYM, IDR data even if you want to! 


e It’s amazing that all this has worked (somehow) for so long a time! 
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Looking Forward 
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é The DFSMS/MVS Binder and Loader 


Totally new product and new technology 

— Binder replaces Linkage Editor, Batch Loader 

— Loader replaces Program Fetch 

— Answers a very large set of customer requirements 

Fixes a vast array of usability and performance problems 

— Many new messages, added information, and detailed diagnostics 
— Almost all internal constraints removed 

— Linkage Editor is quirky, far too forgiving of errors, full of loopholes 
Supports a new “execution unit” —- a Program Object 

— Enhances performance, flexibility, integrity 


— Internal structure not externalized; data-access interfaces provided 


— Stored in PDSE’s, which fix almost all PDS problems (space, integrity, 
compression, performance, sharability, etc.) 


Base for future enhancements 
Available March 1993, in DFSMS/MVS 1.1 
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What We’ve Discussed 


Why program linking is a Good Thing 
What is in object modules, and where they come from 


How inter-module references are resolved to form an executable 
program 


What is in load modules, and how they are built by the Linkage Editor 


How load modules are loaded into storage and relocated 
Some history 


Where this technology is going 
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9. High Level Assembler/MVS & VM & VSE Programmer s Guide 
~ ($C26-4941) 


e These Assembler publications describe the most basic forms of language 
elements that create inputs to the Linkage Editor, Loader, and Binder. 
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